# Japanese Speech Recognition
Parakeet Tdt Ctc 0.6b Ja
This model is a Japanese automatic speech recognition (ASR) model based on the FastConformer architecture, developed by NVIDIA and converted to MLX format.
Speech Recognition
P
mlx-community
368
1
Kotoba Whisper V2.2 Faster
MIT
This is a Japanese automatic speech recognition (ASR) model based on the Whisper architecture, converted to CTranslate2 format for improved inference efficiency.
Speech Recognition Japanese
K
RoachLin
99
1
Vlzcrz Whisper Small Japanese 2
Apache-2.0
A Japanese speech recognition model fine-tuned on the Common Voice 17.0 dataset based on openai/whisper-small
Speech Recognition
Transformers Japanese

V
vlzcrz
28
1
Japanese Wav2vec2 Large Rs35kh
Apache-2.0
A Japanese automatic speech recognition model fine-tuned on the large-scale Japanese ASR corpus ReazonSpeech v2.0, based on the wav2vec 2.0 Large architecture
Speech Recognition
Transformers Japanese

J
reazon-research
244
1
Kotoba Whisper V2.0 Faster
MIT
A Whisper speech recognition model optimized for CTranslate2, specifically tailored for Japanese, providing efficient speech-to-text functionality.
Speech Recognition Japanese
K
kotoba-tech
202
14
Kotoba Whisper V2.1
Apache-2.0
Kotoba-Whisper-v2.1 is a Japanese automatic speech recognition (ASR) model based on Whisper, integrating an additional post-processing stack that automatically adds punctuation marks.
Speech Recognition
Transformers Japanese

K
kotoba-tech
2,589
16
Whisper Large V3 Japanese 4k Steps
Apache-2.0
A speech recognition model fine-tuned on the Common Voice 16.1 Japanese dataset based on openai/whisper-large-v3, trained for 4000 steps
Speech Recognition
Transformers Japanese

W
drewschaub
94
4
Nue Asr
Apache-2.0
Nue ASR is an end-to-end Japanese speech recognition model that integrates pre-trained speech and language models, offering high accuracy and fast recognition speed.
Speech Recognition
Transformers Supports Multiple Languages

N
rinna
722
24
Faster Whisper Large V2 Mix Jp
This is the CTranslate2 converted version of the whisper-large-v2-mix-jp model, suitable for Japanese speech recognition tasks
Speech Recognition Japanese
F
arc-r
64
9
Faster Whisper Large V2 Japanese 5k Steps
MIT
A Japanese automatic speech recognition (ASR) model based on Whisper Large V2, optimized with CTranslate2 for efficient inference.
Speech Recognition
Transformers Japanese

F
zh-plus
280
18
Whisper Base Japanese
Apache-2.0
This model is fine-tuned on the Common Voice, JVS, and JSUT datasets for Japanese speech recognition tasks using openai/whisper-base.
Speech Recognition
Transformers Japanese

W
Ivydata
137
3
Whisper Medium Jp
Apache-2.0
Japanese speech recognition model fine-tuned on the common_voice_11_0 dataset based on openai/whisper-medium
Speech Recognition
Transformers Japanese

W
vumichien
4,542
25
Exp W2v2t Ja Vp It S544
Apache-2.0
A Japanese automatic speech recognition model fine-tuned using the training set of Common Voice 7.0 (Japanese version), based on the facebook/wav2vec2-large-it-voxpopuli model.
Speech Recognition
Transformers Japanese

E
jonatasgrosman
18
0
Exp W2v2t Ja Unispeech Sat S884
Apache-2.0
A Japanese automatic speech recognition model fine-tuned based on the microsoft/unispeech-sat-large model, trained using the Common Voice 7.0 Japanese dataset.
Speech Recognition
Transformers Japanese

E
jonatasgrosman
19
0
Exp W2v2t Ja Wavlm S729
Apache-2.0
A Japanese automatic speech recognition model fine-tuned based on microsoft/wavlm-large, trained using the Common Voice 7.0 Japanese dataset
Speech Recognition
Transformers Japanese

E
jonatasgrosman
15
2
Exp W2v2t Ja Unispeech S569
Apache-2.0
A Japanese automatic speech recognition model fine-tuned using the Common Voice 7.0 (Japanese) dataset, based on the microsoft/unispeech-large-1500h-cv model
Speech Recognition
Transformers Japanese

E
jonatasgrosman
14
0
Exp W2v2t Ja Xlsr 53 S109
Apache-2.0
Japanese automatic speech recognition model fine-tuned based on facebook/wav2vec2-large-xlsr-53, trained using Common Voice 7.0 Japanese dataset
Speech Recognition
Transformers Japanese

E
jonatasgrosman
20
0
Wav2vec2 Xls R 1b Japanese
Apache-2.0
This model is a fine-tuned version of facebook/wav2vec2-xls-r-1b on public Japanese speech datasets, supporting automatic speech recognition tasks in Japanese.
Speech Recognition
Transformers Japanese

W
vumichien
50
2
Wav2vec2 Large Xlsr 53 Japanese
Apache-2.0
Japanese speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, supporting 16kHz sampling rate audio input
Speech Recognition Japanese
W
jonatasgrosman
2.9M
33
Wav2vec2 Xls R 300m Japanese
Apache-2.0
This is a Japanese automatic speech recognition model fine-tuned based on facebook/wav2vec2-xls-r-300m, specifically designed for transcribing Japanese audio into Hiragana text.
Speech Recognition
Transformers Japanese

W
vitouphy
29
0
W2v Hf Jsut Xlsr53
Apache-2.0
A Japanese automatic speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53 using the Common Voice and JSUT datasets.
Speech Recognition
Transformers Japanese

W
qqpann
16
1
Wav2vec2 Large Xlsr Japanese
Apache-2.0
A fine-tuned model based on facebook/wav2vec2-large-xlsr-53 for Japanese speech recognition tasks.
Speech Recognition
Transformers Japanese

W
vumichien
214
5
Wav2vec2 Live Japanese
Apache-2.0
A Japanese speech recognition model fine-tuned based on facebook/wav2vec2-large-xlsr-53, supporting hiragana output
Speech Recognition
Transformers Japanese

W
ttop324
20
4
Wav2vec2 Xls R 300m Japanese
Apache-2.0
This is an automatic speech recognition (ASR) model fine-tuned on the Japanese Common Voice 8.0 dataset based on facebook/wav2vec2-xls-r-300m, supporting Japanese speech-to-text functionality.
Speech Recognition
Transformers Japanese

W
AndrewMcDowell
24
0
Wav2vec2 Large Japanese
Japanese speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, supports 16kHz sampling rate input
Speech Recognition Japanese
W
NTQAI
316
7
Featured Recommended AI Models